Noise robust speech parameterization based on joint wavelet packet decomposition and autoregressive modeling
نویسندگان
چکیده
In this paper a noise robust feature extraction algorithm using joint wavelet packet decomposition (WPD) and an autoregressive (AR) modeling of the speech signal is presented. In opposition to the short time Fourier transform (STFT) based time-frequency signal representation, a computationally efficient WPD can lead to better representation of non-stationary parts of the speech signal (consonants). The vowels are well described with an AR model like in LPC analysis. The separately extracted WPD and AR based features are combined together with the usage of modified principal component analysis (PCA) and voiced/unvoiced decision to produce final output feature vector. The noise robustness is improved with the application of the proposed wavelet based denoising algorithm with the modified soft thresholding procedure and the voice activity detection. Speech recognition results on Aurora 3 databases show performance improvement of 47.6 % relative to the standard MFCC front-end.
منابع مشابه
A Comprehensive Noise Robust Speech Parameterization Algorithm Using Wavelet Packet Decomposition-Based Denoising and Speech Feature Representation Techniques
This paper concerns the problem of automatic speech recognition in noise-intense and adverse environments. The main goal of the proposed work is the definition, implementation, and evaluation of a novel noise robust speech signal parameterization algorithm. The proposed procedure is based on time-frequency speech signal representation using wavelet packet decomposition. A new modified soft thre...
متن کاملA New Algorithm for Voice Activity Detection Based on Wavelet Packets (RESEARCH NOTE)
Speech constitutes much of the communicated information; most other perceived audio signals do not carry nearly as much information. Indeed, much of the non-speech signals maybe classified as ‘noise’ in human communication. The process of separating conversational speech and noise is termed voice activity detection (VAD). This paper describes a new approach to VAD which is based on the Wavelet ...
متن کاملA Generalized Time–Frequency Subtraction Method for Robust Speech Enhancement Based on Wavelet Filter Banks Modeling of Human Auditory System
We present a new speech enhancement scheme for a single-microphone system to meet the demand for quality noise reduction algorithms capable of operating at a very low signal-tonoise ratio. A psychoacoustic model is incorporated into the generalized perceptual wavelet denoising method to reduce the residual noise and improve the intelligibility of speech. The proposed method is a generalized tim...
متن کاملA New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain
Quality of speech signal significantly reduces in the presence of environmental noise signals and leads to the imperfect performance of hearing aid devices, automatic speech recognition systems, and mobile phones. In this paper, the single channel speech enhancement of the corrupted signals by the additive noise signals is considered. A dictionary-based algorithm is proposed to train the speech...
متن کاملRobust Speech Perception Hashing Authentication Algorithm Based on Spectral Subtraction and Multi-feature Tensor
In order to make the speech perception hashing authentication algorithm has strong robustness and discrimination to content preserving operations and speech communication under the common background noise, a new robust speech perceptual hashing authentication algorithm based on spectral subtraction and multi-feature tensor was proposed. The proposed algorithm uses spectral subtraction method to...
متن کامل